The ubiquity of the Simpson’s Paradox

نویسنده

  • Alessandro Selvitella
چکیده

Correspondence: [email protected] Department of Mathematics and Statistics of McMaster University, 1280 Main Street West, Hamilton, (ON) L8S-4K1, Canada Abstract The Simpson’s Paradox is the phenomenon that appears in some datasets, where subgroups with a common trend (say, all negative trend) show the reverse trend when they are aggregated (say, positive trend). Even if this issue has an elementary mathematical explanation, it has a deep statistical significance. In this paper, we discuss basic examples in arithmetic, geometry, linear algebra, statistics, game theory, gender bias in university admission and election polls, where we describe the appearance or absence of the Simpson’s Paradox. In the final part, we present our results concerning the occurrence of the Simpson’s Paradox in Quantum Mechanics with focus on the Quantum Harmonic Oscillator and the Nonlinear Schrödinger Equation. We discuss how likely it is to incur in the Simpson’s Paradox and give some concrete numerical examples. We conclude with some final comments and possible future directions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational Social Scientist Beware: Simpson's Paradox in Behavioral Data

Observational data about human behavior is often heterogeneous, i.e., generated by subgroups within the population under study that vary in size and behavior. Heterogeneity predisposes analysis to Simpson’s paradox, whereby the trends observed in data that has been aggregated over the entire population may be substantially different from those of the underlying subgroups. I illustrate Simpson’s...

متن کامل

How Likely is Simpson's Paradox in Path Models?

Simpson’s paradox is a phenomenon arising from multivariate statistical analyses that often leads to paradoxical conclusions; in the field of e-collaboration as well as many other fields where multivariate methods are employed. We derive a general inequality for the occurrence of Simpson’s paradox in path models with or without latent variables. The inequality is then used to estimate the proba...

متن کامل

Simpson’s paradox, moderation, and the emergence of quadratic relationships in path models: An information systems illustration

While Simpson’s paradox is well-known to statisticians, it seems to have been largely neglected in many applied fields of research, including the field of information systems. This is problematic because of the strange nature of the phenomenon, the wrong conclusions and decisions to which it may lead, and its likely frequency. We discuss Simpson’s paradox and interpret it from the perspective o...

متن کامل

Simpson’s Paradox in the interpretation of “leaky pipeline” data

The traditional ‘leaky pipeline’ plots are widely used to inform gender equality policy and practice. Herein, we demonstrate how a statistical phenomenon known as Simpson’s paradox can obscure trends in gender ‘leaky pipeline’ plots. Our approach has been to use Excel spreadsheets to generate hypothetical ‘leaky pipeline’ plots of gender inequality within an organisation. The principal factors,...

متن کامل

Simpson’s Paradox – A Survey of Past, Present and Future Research

Simpson’s paradox refers to the reversal of a statistical relationship between two variables in sub-populations when the sub-populations are combined and analyzed as a population. This article is intended to provide a broad survey of the past, present and future research surrounding the issue. Real data from a discrimination litigation case is examined to identify the occurrence of the paradox....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017